Lip Reading Using Convolutional Neural Networks with and without Pre-Trained Models
نویسندگان
چکیده
منابع مشابه
Lip-Reading using Neural Networks
Lip-Reading has been practiced over centuries for teaching deaf and dumb to speak and communicate effectively with the other people. In this study, the use of neural networks in lip reading is explored. We convert the video of the subject speaking different words into images and then images are further selected manually for processing. As per the research the horizontal and the vertical distanc...
متن کاملLip-Reading using Neural Networks
Student of Computer Science Engineering, Lingaya’s Institute of Mgt. & Tech., Faridabad, 121002 India †† Student of Computer Science Engineering, Lingaya’s Institute of Mgt. & Tech., Faridabad, 121002 India ††† Student of Computer Science Engineering, Lingaya’s Institute of Mgt. & Tech., Faridabad, 121002 India Faculty of Computer Science Engineering, Lingaya’s University, Faridabad, 121002 Ind...
متن کاملRemote Sensing Scene Classification Based on Convolutional Neural Networks Pre-Trained Using Attention-Guided Sparse Filters
Semantic-level land-use scene classification is a challenging problem, in which deep learning methods, e.g., convolutional neural networks (CNNs), have shown remarkable capacity. However, a lack of sufficient labeled images has proved a hindrance to increasing the land-use scene classification accuracy of CNNs. Aiming at this problem, this paper proposes a CNN pre-training method under the guid...
متن کاملStudying Invariances of Trained Convolutional Neural Networks
Convolutional Neural Networks (CNNs) define an exceptionally powerful class of models for image classification, but the theoretical background and the understanding of how invariances to certain transformations are learned is limited. In a large scale screening with images modified by different affine and nonaffine transformations of varying magnitude, we analyzed the behavior of the CNN archit...
متن کاملGlobally Trained Handwritten Word Recognizer Using Spatial Representation, Convolutional Neural Networks, and Hidden Markov Models
Yann Le Cun AT&T Bell Labs Holmdel NJ 07733 We introduce a new approach for on-line recognition of handwritten words written in unconstrained mixed style. The preprocessor performs a word-level normalization by fitting a model of the word structure using the EM algorithm. Words are then coded into low resolution "annotated images" where each pixel contains information about trajectory direction...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Balkan Journal of Electrical and Computer Engineering
سال: 2019
ISSN: 2147-284X
DOI: 10.17694/bajece.479891